From data pipelines to FAIR data infrastructures: A vision for the new horizons of bio- and geodiversity data for scientific research

نویسندگان

چکیده

Natural science collections are vast repositories of bio- and geodiversity specimens. These collections, originating from natural history cabinets or expeditions, increasingly becoming unparalleled sources data facilitating multidisciplinary research (Meineke et al. 2018, Heberling 2019, Cook 2020, Thompson 2021). Due to various global mobilization digitisation efforts (Blagoderov 2012,Nelson Ellis 2018), this digitised information about specimens includes database records along with two/three-dimensional images, sonograms, sound video recordings, computerised tomography scans, machine-readable texts labels on the as well media items notes related discovery sites acquisition (Hedrick 2020,Phillipson 2022). The scope practice specimen gathering also evolving. term extended was coined refer associated extending beyond singular physical object other digital entities such chemical composition, genetic sequence species data. Thus becomes an interconnected network resources that have incredible potential enhance integrative data-driven (Webster 2017,Lendemer 2019,Hardisty practices reflect role curatorial life-cycle starting initial material sampling process downstream analysis. We seeing growing acknowledgement disparate domain specific elements prevent interdisciplinarity which is crucial for a holistic understanding biodiversity climate crisis (Hicks 2010, Craven Folk Siniscalchi not just rows in pipelines going one repository another. They become self-describing artefacts can revolutionise how machines interpret work Within context, Distributed System Scientific Collections (DiSSCo), new European Research Infrastructure envisions infrastructure based FAIR Digital Objects (FDO) unify more than 170 under common FAIR-compliant (Findable, Accessible, Interoperable, Reusable) (Wilkinson 2016) access curation policies practices. DiSSCo’s key element achieving implementation Specimen (a FDO) closely aligns idea behind – FDO acts surrogate collection influenced by conversations around Object Architecture (De Smedt Islam 2020,Hardisty 2020). main purpose talk explain vision create only take advantage existing databases but at same time provide support innovative services AI twinning. With scientific use cases mind, will highlight few components (persistent identifiers, metadata, ontologies) within collaborative modelling activity specification. template specifying should look so DiSSCo build service ecosystem FDOs (Addink give examples envisioned help image feature extraction, model training (Grieb 2021,Hardisty 2022) twinning (Schultes believe exciting paradigm powered both humans accelerate From objects curated over hundred years, we developed pipelines, aggregators (Barberousse Now solutions where these enable wider research.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

A Vision for Global Research Data Infrastructures

New high-throughput scientific instruments, telescopes, satellites, accelerators, supercomputers, sensor networks, and running simulations are generating massive amounts of data. In order to be able to exploit these huge volumes of data, a new type of e-infrastructure, the Global Research Data Infrastructure (GRDI), must be developed for harnessing the accumulating data and knowledge produced b...

متن کامل

metrics for the detection of changed buildings in 3d old vector maps using als data (case study: isfahan city)

هدف از این تحقیق، ارزیابی و بهبود متریک های موجود جهت تایید صحت نقشه های قدیمی سه بعدی برداری با استفاده از ابر نقطه حاصل از لیزر اسکن جدید شهر اصفهان می باشد . بنابراین ابر نقطه حاصل از لیزر اسکنر با چگالی حدودا سه نقطه در هر متر مربع جهت شناسایی عوارض تغییر کرده در نقشه های قدیمی سه بعدی استفاده شده است. تمرکز ما در این تحقیق بر روی ساختمان به عنوان یکی از اصلی ترین عارضه های شهری می باشد. من...

A New Nonparametric Regression for Longitudinal Data

In many area of medical research, a relation analysis between one response variable and some explanatory variables is desirable. Regression is the most common tool in this situation. If we have some assumptions for such normality for response variable, we could use it. In this paper we propose a nonparametric regression that does not have normality assumption for response variable and we focus ...

متن کامل

A new approach for data visualization problem

Data visualization is the process of transforming data, information, and knowledge into visual form, making use of humans’ natural visual capabilities which reveals relationships in data sets that are not evident from the raw data, by using mathematical techniques to reduce the number of dimensions in the data set while preserving the relevant inherent properties. In this paper, we formulated d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Research Ideas and Outcomes

سال: 2022

ISSN: ['2367-7163']

DOI: https://doi.org/10.3897/rio.8.e93816